Implementation Issues in the Design of I/O Intensive Data Mining Applications on Clusters of Workstations

نویسندگان

  • Ranieri Baraglia
  • Domenico Laforenza
  • Salvatore Orlando
  • Paolo Palmerini
  • Raffaele Perego
چکیده

This paper investigates scalable implementations of out-ofcore I/O-intensive Data Mining algorithms on a ordable parallel architectures, such as clusters of workstations. In order to validate our approach, the K-means algorithm, a well known DM Clustering algorithm, was used as a test case.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A High Performance Implementation of the Data Space Transfer Protocol (DSTP)

With the emergence of high performance networks, clusters of workstations can now be connected by commodity networks (meta-clusters) or high speed networks (super-clusters) such as the very high speed Backbone Network Service (vBNS) or Internet2’s Abilene. Distributed clusters are enabling a new class of data mining applications in which large amounts of data can be transferred using high perfo...

متن کامل

Network Capacity for Data Intensive Applications on Clusters of Workstations

Component software distribution and the use of clusters of workstations are all key trends in today s technology Little attention has been paid however to the network bandwidth re quired for data intensive applications In the context of databases much work has been done in parallelization strategies for monolithic architectures with dedicated specialized networks or over disk arrays We envision...

متن کامل

2 System Model and Notation Client Client Client MIDDLEWARE

Component software, distribution, and the use of clusters of workstations are all key trends in today's technology. Little attention has been paid, however, to the network bandwidth required for data intensive applications. In the context of databases, much work has been done in parallelization strategies for monolithic architectures with dedicated, specialized networks or over disk arrays. We ...

متن کامل

Workshop on Large − Scale Parallel KDD Systems in conjunction with the 5 th ACM SIGKDD International Conference on

With the emergence of high performance networks, clusters of workstations can now be connected by commodity networks (meta-clusters) or high speed networks (super-clusters) such as the very high speed Backbone Network Service (vBNS) or Internet2's Abilene. Distributed clusters are enabling a new class of data mining applications in which large amounts of data can be transferred using high perfo...

متن کامل

Application of international energy efficiency standards for energy auditing in a University buildings

This study seeks to provide insights on understanding the contemporary problems of energy efficiency in Ukrainian universities by developing a comprehensive energy efficiency management framework that encompasses its participating subjects, objects and key drivers along with suggesting its implementation mechanism and tools. Emphasis should be given that the current situation of inefficient and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000